Unsupervised domain adaptation with non-stochastic missing data

نویسندگان

چکیده

We consider unsupervised domain adaptation (UDA) for classification problems in the presence of missing data unlabelled target domain. More precisely, motivated by practical applications, we analyze situations where distribution shift exists between domains and some components are systematically absent on without available supervision imputing components. propose a generative approach imputation. Imputation is performed domain-invariant latent space leverages indirect from complete source introduce single model performing joint adaptation, imputation which, under our assumptions, minimizes an upper bound its generalization error performs well various representative divergence families ( $$\mathscr {H}$$ -divergence, Optimal Transport). Moreover, compare adaptation-imputation framework “ideal” UDA classifier Our further improved with self-training, to bring learned class posterior distributions closer. perform experiments three datasets different modalities: classical digit benchmark, Amazon product reviews dataset both commonly used real-world digital advertising datasets. show benefits jointly these

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Unsupervised Learning with Non-Ignorable Missing Data

In this paper we explore the topic of unsupervised learning in the presence of nonignorable missing data with an unknown missing data mechanism. We discuss several classes of missing data mechanisms for categorical data and develop learning and inference methods for two specific models. We present empirical results using synthetic data which show that these algorithms can recover both the unkno...

متن کامل

Unsupervised Transductive Domain Adaptation

Supervised learning with large scale labeled datasets and deep layered models has made a paradigm shift in diverse areas in learning and recognition. However, this approach still suffers generalization issues under the presence of a domain shift between the training and the test data distribution. In this regard, unsupervised domain adaptation algorithms have been proposed to directly address t...

متن کامل

Unsupervised Domain Adaptation with Feature Embeddings

Representation learning is the dominant technique for unsupervised domain adaptation, but existing approaches often require the specification of “pivot features” that generalize across domains, which are selected by task-specific heuristics. We show that a novel but simple feature embedding approach provides better performance, by exploiting the feature template structure common in NLP problems.

متن کامل

Unsupervised Domain Adaptation with Residual Transfer Networks

The recent success of deep neural networks relies on massive amounts of labeled data. For a target task where labeled data is unavailable, domain adaptation can transfer a learner from a different source domain. In this paper, we propose a new approach to domain adaptation in deep networks that can jointly learn adaptive classifiers and transferable features from labeled data in the source doma...

متن کامل

Unsupervised Multi-Domain Adaptation with Feature Embeddings

Representation learning is the dominant technique for unsupervised domain adaptation, but existing approaches have two major weaknesses. First, they often require the specification of “pivot features” that generalize across domains, which are selected by taskspecific heuristics. We show that a novel but simple feature embedding approach provides better performance, by exploiting the feature tem...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Data Mining and Knowledge Discovery

سال: 2021

ISSN: ['1573-756X', '1384-5810']

DOI: https://doi.org/10.1007/s10618-021-00775-3